LENGTH DETERMINATION OF NUCLEIC ACID REPEAT SEQUENCES BY 
DISCONTINUOUS PRIMER EXTENSION 



CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation of Application Serial No. 09/205,1 14, filed December 3, 
5 1998, which is a continuation-in-part of Application Serial No. 08/863,437, filed May 27, 1997, 
which are all incorporated herein by reference. 

FIELD OF THE INVENTION 
This invention relates to methods and kits useful for determining the length of nucleic 
acid repeat sequences. More specifically, this invention relates to methods and kits useful for 
10 determining the length of nucleic acid repeat sequences by employing a discontinuous primer 
extension reaction. 
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BACKGROUND 

Methods for the analysis of genetic polymorphism have found wide utility in basic 
research, clinical diagnostics, forensics, and other areas. One particularly useful method of 
detecting genetic polymorphism is based on variations in the length of repeat sequences, such 
methods being variously referred to as short tandem repeat analysis (STR), variable number of 
tandem repeat analysis (VNTR), minisatellite analysis, and microsatellite analysis. 

Detection of length polymorphisms in nucleic acid repeat sequences has up to now relied 
on gel electrophoresis for the determination of the length of the repeat sequence. However, gel 
electrophoresis has several important drawbacks in the context of repeat sequence length 
polymorphism analysis. First, molecular length measurements based on electrophoretic mobility 
are inherently imprecise due to a complicated relationship between molecular size and 
electrophoretic mobility. Second, the degree to which the electrophoretic process can be 
multiplexed is limited by the number of electrophoresis lanes and by the size of different loci run 
in a single lane, i.e., loci must be selected which do not electrophoretically co-migrate. 



SUMMARY 

The method of the present invention comprises a discontinuous primer extension reaction 
wherein a primer is extended in discrete increments such that in each increment of primer 



extension the primer is extended by an amount corresponding to a single repeat unit. Following 
each increment of discrete primer extension, a detection step is performed in which a modulation 
in a signal is detected when the primer has been extended by an amount equal to the total length 
of a repeat region. Thus, by counting the number of increments of discrete primer extension 
required to cause a modulation in the signal, the number of repeat units making up the repeat 
region is determined. 

It is an object of the present invention to provide a precise and reproducible method for 
determining the number of repeat units making up a repeat region of a nucleic acid repeat 
sequence. 

It is another object of the present invention to provide a method for determining the 
number of repeat units making up a repeat region of a nucleic acid repeat sequence which can 
perform a large number of measurements in parallel. 

It is yet an additional object of the present invention to provide a method for detennining 
the number of repeat units making up a repeat region of a nucleic acid repeat sequence which 
does not require an electrophoretic separation. 

It is an object of the present invention to provide kits and reagents useful for practicing a 
method for dete rminin g the number of repeat units making up a repeat region of a nucleic acid 
repeat sequence having the above described characteristics. 

In a first aspect, the foregoing and other objects of the invention are achieved by a 
method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 
thereby forming a target-primer hybrid; performing a first primer extension reaction using a first 
primer extension reagent; separating the target-primer hybrid and unreacted first primer 
extension reagent; performing a second primer extension reaction using a second primer 
extension reagent, wherein at least one of the first or second primer extension reagents includes 
an extendible nucleotide having a label attached thereto; separating the target-primer hybrid from 
unreacted second primer extension reagent; measuring a signal produced by the label; treating 



the label so as to render the label undetectable; and repeating the above steps until the signal is 
substantially less than a signal detected in a previous cycle. 

In one preferred embodiment of the first aspect of the invention, the step of performing a 
second primer extension reaction further includes reacting the target-primer hybrid with a primer 
termination reagent. 

In yet another preferred embodiment of the first aspect of the invention, the label is a 
fluorescent or chemiluminescent molecule. 

In another preferred embodiment of the first aspect of the invention, the label is attached 
to the extendible nucleotide through a cleavable linker. 

In an additional preferred embodiment of the first aspect of the invention, the target 
nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
PCR. 

In an another preferred embodiment of the first aspect of the invention, the step of 
treating the label so as to render the label undetectable includes either cleaving the label from the 
labeled extendible nucleotide or destroying a signal producing property of the label. 

In another preferred embodiment of the first aspect of the invention, the target-primer 
hybrid is attached to a solid support. Preferably, one of the primer or the target nucleic acid is 
attached to the solid support. 

In a novel, preferred embodiment employing a solid support, the invention includes a 
method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide sample 
30 under conditions effective for the primers to anneal to primer-complementary regions in one or 
more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
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having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on the hybrid(s) using a first primer 
extension reagent; 

(C) separating the target-primer hybrid(s) and unreacted first primer extension reagent; 

(D) performing a second primer extension reaction on the hybrid(s) using a second 
primer extension reagent, wherein at least one of the first or second primer extension reagents 
includes an extendible nucleotide having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
reagent; 

(F) measuring a signal produced by the label; 

(G) treating the label so as to render the label undetectable; and 

(H) repeating a cycle of steps (A) through (G) until the signal detected in the target- 
primer hybrid(s) is substantially less than a signal detected in a previous cycle, 

wherein (I) prior to step (F), at least an aliquot of either (1) the different-sequence 
primers or (2) the tagged sample polynucleotides are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

In one embodiment, the contacting in step (I) is performed prior to step (A). In another 
embodiment, the contacting in step (I) is performed after step (A), and/or before any one of steps 
(B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (H) are performed on 
at least two replicate arrays, and one of the replicate arrays is subjected to at least one more cycle 
of steps (A) through (G) than is a second replicate array. 

In a second aspect, the foregoing and other objects of the invention are achieved by a 
method for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising annealing a primer-complementary portion of a target nucleic acid to a primer 
thereby forming a target-primer hybrid; performing a first primer extension reaction using a first 
primer-extension reagent; separating the target-primer hybrid from unreacted first primer 
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extension reagent; performing a second primer extension reaction using a second primer 
extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; separating the target-primer 
hybrid from unreacted second primer extension reagent and unreacted primer termination 
5 reagent; measuring a signal produced by the label; and repeating the above steps until a signal is 
detected indicating incorporation of the labeled nucleotide terminator into the primer extension 
product. 

In yet another preferred embodiment of the second aspect of the invention, the label is 
10 selected from the group consisting of fluorescent and chemiluminescent molecules. 

In an additional preferred embodiment of the second aspect of the invention, the target 
nucleic acid is amplified prior to analysis. Preferably such amplification is achieved using a 
PCR. 

15 

In a preferred embodiment of the second aspect of the invention, the target-primer hybrid 
is attached to a solid support. Preferably, one of the primer or the target nucleic acid is attached 
to the solid support. 

20 In a novel, preferred embodiment, the invention includes a method for determining the 

number of repeat units in a repeat region of a target nucleic acid comprising the steps of: 

(A) contacting a plurality of different-sequence primers with a polynucleotide ^sample 
under conditions effective for the primers to anneal to primer-complementary regions in one or 
more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (1) 

25 each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
having a nucleotide sequence that uniquely identifies the attached polynucleotide, 

(B) performing a first primer extension reaction on the hybrid(s) using a first primer- 
30 extension reagent; 

(C) separating the target-primer hybrid(s) from unreacted first primer extension reagent; 
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(D) performing a second primer extension reaction on the hybrid(s) using a second 
primer extension reagent and with a primer termination reagent, the primer termination reagent 
including a nucleotide terminator having a label attached thereto; 

(E) separating the target-primer hybrid(s) from unreacted second primer extension 
reagent and unreacted primer termination reagent; 

(F) measuring a signal produced by the label; and 

(G) repeating a cycle of steps (A) through (F) until a signal is detected indicating 
incorporation of the nucleotide terminator, 

wherein (H) prior to step (F), at least an aliquot of either (1) the different-sequence 
primers or (2) the tagged sample polynucleotides, are contacted with an addressable array of 
immobilized, different tag complements, and each different tag complement contains a 
sequence that is complementary to one of the tag segments, under conditions effective to 
hybridize the tag segments to corresponding tag complements on the support. 

In one embodiment, the contacting in step (H) is performed prior to step (A). In another 
embodiment, the contacting in step (H) is performed after step (A), and/or before any one of 
steps (B), (C), (D), (E), and (F). In yet another embodiment, steps (A) through (G) are 
performed on at least two replicate arrays, and one of the replicate arrays is subjected to at least 
one more cycle of steps (A) through (F) than is a second replicate array. 

In a third aspect, the foregoing and other objects of the invention are achieved by a kit 
useful for determining the number of repeat units in a repeat region of a target nucleic acid 
comprising a primer having a sequence complementary to a primer-complementary portion of a 
target nucleic acid; a first primer extension reagent; and a second primer extension reagent, 
wherein at least one of the first or second primer extension, reagents includes an extendible 
nucleotide having a label attached thereto. 

In a preferred embodiment of the third aspect of the invention, the primer is attached to a 
solid support. 

In an additional preferred embodiment of the second aspect of the invention, the label is 
selected from the group consisting of fluorescent and chemiluminescent molecules. 
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In another preferred embodiment of the second aspect of the invention, the label is 
attached to the extendible nucleotide through a cleavable linker. 

5 These and other objects, features, and advantages of the present invention will become 

better understood with reference to the following description, drawings, and appended claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a schematic depiction of a target nucleic acid. 

FIGS. 2A-C show a first aspect of the method of the invention wherein an extendible 
nucleotide is labeled and the label is rendered undetectable subsequent to each discrete increment 
of primer extension. 

FIGS. 3A-B show a second aspect of the method of the invention wherein a nucleotide 
terminator is labeled. 

FIGS 4 and 5 show exemplary schemes for practicing the invention using a solid phase 
support containing an array of tag complements for obtaining sequence repeat information for a 
plurality of different samples in parallel, using tagged primers or tagged sample polynucleotides. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the preferred embodiments of the invention, 
examples of which are illustrated in the accompanying drawings. While the invention will be 
25 described in conjunction with the preferred embodiments, it will be understood that they are not 
intended-to limit the invention to those embodiments. On the contrary, the invention is intended 
to cover alternatives, modifications, and equivalents, which may be included within the invention 
as defined by the appended claims. 

30 L DEFINITIONS 

Unless stated otherwise, the following terms and phrases as used herein are intended to 
have the following meanings: 
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"Nucleoside" refers to a compound consisting of a purine, deazapurine, or pyrimidine 
nucleoside base, e.g., adenine, guanine, cytosine, uracil, thymine, deazaadenine, deazaguanosine, 
and the like, linked to a pentose at the 1' position, including 2'-deoxy and I'-hydroxyl forms 
(Stryer). 

The term "nucleotide" as used herein refers to a phosphate ester of a nucleoside, e.g., a 
triphosphate ester, wherein the most common site of esterification is the hydroxyl group attached 
at the C-5 position of the pentose. Many times in the present disclosure the term nucleoside will 
be intended to include both nucleosides and nucleotides. The terms nucleotide and nucleoside as 
used herein are intended to include synthetic analogs having modified nucleoside base moieties, 
modified sugar moieties, and/or modified phosphate ester moieties, e.g., as described elsewhere 
(Scheit; Eckstein). 

"Polynucleotide" or "oligonucleotide" refer to linear polymers of nucleotide monomers, 
including single, double and triple stranded deoxyribonucleotides, ribonucleotides, a-anomeric 
forms thereof, and the like. Usually the nucleoside monomers are linked by phosphodiester 
linkages, where as used herein, the term "phosphodiester linkage" refers to phosphodiester bonds 
or bonds including phosphate analogs thereof wherein the phosphorous atom is in the +5 
oxidation state and one or more of the oxygen atoms is replaced with a non-oxygen moiety. 
Exemplary phosphate analogs include phosphorothioate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilpthioate, phosphoranilidate, 
phosphoramidate, boronophosphates, and the like, including associated counterions, e.g., H + , 
NH 4 + , Na + , and the like if such counterions are present. Alternatively, polynucleotides may 

comprise polymers of non-nucleotidic monomers, linked through phosphodiester linkages or 
other linkages, which are capable of forming sequence-specific hybrids with a target nucleic 
acid, e.g., peptide nucleic acid polymers (PNAs, e.g., see Knudsen, 1996). Polynucleotides 
typically range in size from a few monomelic units, e.g. 8-40, to several thousands of 
monomelic units. Whenever a polynucleotide is represented by a sequence of letters, such as 
"ATGCCTG," it will be understood that the nucleotides are in 5'->3' order from left to right and 
that "A" denotes deoxyadenosine, "C" denotes deoxycytidine, "G M denotes deoxyguanosine, and 
"T" denotes thymidine, unless otherwise noted. 
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"Extendible nucleotide" means any nucleotide that when incorporated into a primer 
extension product during a primer extension reaction allows for the further extension of such 
primer extension product. Exemplary extendible nucleotides include 2'-deoxynucleotide 
5 triphosphates, e.g., 2'-deoxyuridine-5' -triphosphate, 2 , -deoxyguanosine-5' -triphosphate, 2'- 
deoxy-7-deazadeoxy guanosine-5 ' -triphosphate, 2 ' -deoxy adenosine-5 ' -triphosphate, . 2 ' - 
deoxythymidine-5' -triphosphate, and 2 ' -deoxy cytidine-5' -triphosphate. Optionally, one or more 
of the extendible nucleotides includes a label. 

10 "Nucleotide terminator" means any nucleotide that when incorporated into a primer 

extension product prevents the further extension of such primer extension product. One 
requirement of a nucleotide terminator is that when the nucleotide terminator includes a 
ribofuranose sugar portion, the 3' -position must not have a hydroxy group capable of being 
subsequently used by a polymerase to incorporate additional nucleotides. Alternatively, a 

15 ribofuranose analog could be used, such as arabinose. Exemplary nucleotide terminators include 
2\3'-dideoxy-p-D-ribofuranosyl, p-D-arabinofuranosyl, 3 '-deoxy-p-D-arabinofuranosyl, 3 
amino-2',3'-dideoxy-P-D-ribofuranosyl, and 2',3 , -dideoxy-3'-fluoro-p-D-ribofuranosyl 
(Chidgeavadze, 1984, 1985). Nucleotide terminators also include reversible nucleotide 
terminators (Metzker), and 3' -deoxy substituents such as hydrogen, 3'-fluoro, 3' -amino, and 3'- 

20 azido, for example (Mikhailopulo et al., 1989; Krayevski et al., 1984; Chidgeavadze, 1986). 

"Polymerase" means an enzyme or other catalyst capable of catalyzing a reaction leading 
to a target-sequence dependent incorporation of a nucleotide onto a 3' -end of a primer or primer 
extension product when such primer or primer extension product is annealed to a target nucleic 
25 acid. Exemplary polymerases include but are not limited to Pfu DNA polymerase, E.Coli 
Polymerase I, 7*- 7 polymerase, reverse transcriptase, Taq DNA polymerase, and the like 
(Romberg and Baker). 

"Label" means any moiety that, when attached to a nucleotide or polynucleotide of the 
30 invention, render such nucleotide or polynucleotide detectable using known detection means. 
Labels may be direct labels which themselves are detectable or indirect labels which are 
detectable in combination with other agents. Exemplary direct labels include but are not limited 
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to fluorophores, chromophores, radioisotopes, spin-labels, chemiluminescent labels, and the like. 
Exemplary indirect labels include enzymes which catalyze a signal-producing event, and ligands 
such as an antigen or biotin which can bind specifically with high affinity to a detectable anti- 
ligand, such as a labeled antibody or avidin. 

"Primer extension reaction" means a reaction between a target-primer hybrid and a 
nucleotide which results in the addition of the nucleotide to an end of the primer, usually the 3'- 
end, such that the added nucleotide is complementary to the corresponding nucleotide of the 
target nucleic acid. 

"Primer-extension reagent" means a reagent including components necessary to effect a 
primer extension reaction. Primer extension reagents typically include (i) a polymerase enzyme; 
(ii) a buffer; and (iii) one or more extendible nucleotides. 

"Specific binding pair" refers to a pair of molecules that specifically bind to one another 
to form a binding complex. Examples of specific binding pairs include, but are not limited to 
antibody-antigen (or hapten) pairs, ligand-receptor pairs, enzyme-substrate pairs, biotin-avidin 
pairs, polynucleotides having complementary base pairs, and the like. 

"Primer" is a polynucleotide capable of selectively annealing to a specified target 
sequence and thereafter serve as a point of initiation of a primer extension reaction wherein the 
primer is extended in the 3' -* 5' a 5' -^3' direction, typically the latter. 

II. MATERIALS USED IN THE METHOD OF THE INVENTION 

A. Target Nucleic Acid 

The target nucleic acids for use with the invention may be derived from any living or 
once living organisms, including but not limited to prokaryotes, eukaryotes, plants, animals, 
and viruses, as well as synthetic nucleic acids. The target nucleic acids may originate from any . 
of a wide variety of sample types, such as cell nuclei (e.g., genomic DNA) and extranuclear 
nucleic acids, e.g., plasmids, mitrochondrial nucleic acids, and the like. The target nucleic 
acids can include DNA or RNA, and are usually DNA. 
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Many methods are available for the isolation and purification of a target nucleic acid for 
use in the present invention. The preferred purification method should provide target nucleic 
acid sufficiently free of protein to .allow efficient primer extension and nucleic acid 
amplification. Preferred purification methods include (i) organic extraction followed by ethanol 
precipitation, e.g., using a phenol/chloroform organic reagent (Ausubel), preferably using an 
automated DNA extractor, e.g., the Model 341 DNA Extractor available from PE Applied 
Biosystems (Foster City, CA); (ii) solid phase adsorption methods (Walsh, Boom); and (iii) 
salt-induced DNA precipitation methods (Miller), such methods being typically referred to as 
"salting-out" methods. Optimally, each of the above purification methods is preceded by an 
enzyme digestion step to help eliminate protein from the sample, e.g., digestion with proteinase 
K, or other like proteases. 

To increase sensitivity, preferably the target nucleic acid is amplified prior to performing 
the method using a suitable nucleic acid amplification procedure. Such amplification may be 
linear or exponential. In a preferred embodiment, amplification of the target nucleic acid is 
accomplished using the polymerase chain reaction (PCR) (Mullis). Generally, the PCR consists 
of an initial denaturation step which separates the strands of a double stranded nucleic acid 
sample, followed by the repetition of (i) an annealing step which allows amplification primers to 
anneal specifically to positions flanking a target sequence; (ii) an extension step which extends 
the primers in a 5'— » 3' direction thereby forming an amplicon nucleic acid complementary to 
the target sequence, and (iii) a denaturation step which causes the separation of the amplicon and 
the target sequence. Each of the above steps may be conducted at a different temperature, where 
the temperature changes may be accomplished using a thermocycler (PE Applied Biosystems, 
Foster City, CA). 

The generalized structure of a target nucleic acid for use in the present invention is shown 
in FIG. 1 where the target nucleic acid 5 includes a 5 '-flanking portion 10 including a primer 
complementary portion 15, a 3 '-flanking portion 25, and a repeat region 20 located between the 
5 '-flanking portion and the and the 3 '-flanking portion. The repeat region 20 of the target 
nucleic acid comprises multiple repeat units (R) n 21 where R indicates a repeat unit and n 
designates the number of repeat units making up the repeat region. The repeat unit R may be any 
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type of repeat motif, for example, but not limited to a microsatellite repeat (Webber and May; 
Smeets; Williamson), a minisatellite repeat (Jeffreys, Caskey), or an cc-satellite repeat (Jabs). 

The repeat region may be made up of multiple types of repeat units or repeat units which 
5 are themselves polymorphic. 

B. Primer 

Primers for use in the present invention are designed to obtain a balance between 
specificity of primer annealing, i.e., the frequency with which an undesired target sequence 
10 participates in a primer extension reaction, and efficiency of primer extension,, i.e., the extent to 
which a desired target sequence participates in the primer extension reaction. 

Specificity of primer annealing is generally controlled by the length of the primer and the 
temperature of the annealing reaction. Polynucleotides between about 18 and 24 bases are 
15 preferred because such polynucleotides tend to be very sequence specific when the annealing 
temperature is set within a few degrees of a primer melting temperature (Dieffenbach). To 
facilitate primer extension, a 3 '-end of the primer includes an — OH group or other moiety which 
allows incorporation of a nucleotide onto the 3 '-end of the primer. There exist a number of 
computer programs to facilitate primer selection in different contexts (Osborne; Montpetit). 

20 

In a preferred embodiment, the sequence of the primer is selected such that the primer 
anneals to the primer complementary portion of the 5 '-flanking portion of the target nucleic 
acid. Preferably, the primer anneals such that a 3 '-end of the primer is adjacent to a 5 '-end of a 
repeat region of a target nucleic acid. However, the primer may also anneal to a segment of the 
25 repeat region of the target nucleic acid so long as it is at least partially anchored to the 5'- 
flanking portion of the target. 

For embodiments of the invention which employ sample identifier tags and arrays of tag 
complements, the invention utilizes a plurality of extendable, different-sequence primers for 
30 detecting target sequences of interests. In one embodiment, the tagged primer includes a target 
binding segment, a tag segment, and an extendable primer end (5' or 3'). The target binding 
segment includes a polynucleotide sequence which is selected to bind to a selected target 
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sequence. The tag segment contains a unique polynucleotide sequence that allows identification 
of the target binding segment to which the tag segment is attached. The tag segment can be 
directly attached to the distal end of the target binding segment, or is optionally linked to the 
tag segment by an intervening spacer group. In another embodiment, the tag segment is linked 
5 to an internal site within the target binding segment. Thus, the tag can be linked to an 
intersubunit linking group, or to a nucleotide base, within the target binding segment. 
Preferably, the tag is attached to an end of the target binding segment that is distal with respect 
to the extendable end of the primer. 

O 10 The sequence of each target binding segment is selected to hybridize to a selected 

uj complementary target which contains a potential polymorphism or mutation, preferably such 

m that the 3 '-end of the primer is adjacent to a 5'-end of a repeat region of a target nucleic acid (for 

ir extension in the 5' to 3' direction). However, the primer may also anneal to a segment of the 

s repeat region of the target nucleic acid so long as it is at least partially anchored to the 5'- 

q 15 flanking portion of the target. 

The length of the target binding segment in each tagged primer is selected to ensure 
□ specific hybridization of the primer to the desired target, without significant cross- 

hybridization to non-target nucleic acids in the sample. Also, to enhance primer specificity, it 
is preferred that the melting temperatures of the target binding segments are within a few 
20 degrees of each other. Preferably, the melting temperatures of the target binding segments fall 
within a ATm range (Tmax - Tmin) of 10°C or less, and preferably 5°C or less. This can be 
accomplished by suitable choice of binding segment lengths based on known methods for 
predicting primer melting temperatures (Breslauer, 1986; Rychlik, 1989 and 1990; Wetmur, 
1991; Osborne, 1991; Montpetit, 1992) for example. As above, target binding segments 
25 between about 18 and 24 bases in length are preferred. 

The tag segment in each tagged primer is designed to contain a sequence that uniquely 
identifies the attached target binding segment. Thus, the tag sequences should be selected to 
minimize (1) internal, self-hybridization, (2) hybridization with other same-sequence tags, (3) 
30 hybridization with other, different sequence tag complements, (4) and hybridization with the 
sample polynucleotides. Also, it is preferred that each tag can specifically recognize and 
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hybridize to its corresponding tag complement under the same conditions for all tags in the 
primers. 

Tag sequences can be selected by any suitable method. For example, computer 
algorithms for selected non-crosshybridizing sets of tags are described in Brenner (1996) and 
Shoemaker (1997). Preferably, the tag sequences have strands that are within a preselected 
temperature range, as discussed above with respect to the extendable primers. Preferably, the 
melting temperatures of the target binding segments fall within a ATm range (Tmax - Tmin) of 
10°C or less, and preferably within 5°C or less, as calculated using any of the methods above 
(e.g., Breslauer). Preferably, the tag segments are at least 12 bases in length to facilitate 
specific hybridization to corresponding tag complements.* Typically, tag segments are from 12 
to 60 bases in length, and typically from 15 to 30 bases in length. 

Tags and tag complements may be single or double stranded, such that sequence 
specific hybridization forms either duplexes by Watson and Crick base-pairing, or triplexes by 
forward or reverse Hoogsteen bonding. In embodiments where specific hybridization occurs 
via triplex formation, coding of tag sequences follows the same principles as for duplex- 
forming tags; however, there are further constraints on the selection of word sequences. 
Generally, third strand association via Hoogsteen type of binding is most stable along 
homopyrimidine-homopurine tracks in a double stranded target. Usually, base triplets form in 
T-A*T or C-G*C motifs (where "-" indicates Watson-Crick pairing and indicates 
Hoogsteen type of binding); however, other motifs are also possible. For example, Hoogsteen 
base pairing permits parallel and antiparallel orientations between the third strand (the 
Hoogsteen strand) and the purine-rich strand of the duplex to which the third strand binds, 
depending on conditions and the composition of the strands. 

There is extensive guidance in the literature for selecting appropriate sequences, 
orientation, conditions, nucleoside type (e.g. whether ribose or deoxyribose nucleosides are 
employed), base modifications (e.g. methylated cytosine, and the like in order to maximize, or 
otherwise regulate, triplex stability as desired in particular embodiments, e.g., Brenner 
(supra). More generally conditions for annealing single-stranded or duplex tags to single- 
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stranded or duplex sequence complements are well known, e.g. Brenner (supra), Ji et al. 
(1993), Cantor et al. (supra), Wetmur (1991), Breslauer et al. (1986), Schena (1995), and the 
like. 

5 Preferably, polynucleotides such as primers are synthesized conventionally on an 

automated DNA synthesizer, e.g., PE Applied Biosystems (Foster City, CA) model 392 or 394 
DNA/RNA Synthesizer, using standard chemistries, e.g., phosphoramidite chemistry (Beaucage). 
In an alternative method, primers can be isolated from a biological source. 

10 C. Solid Phase Supports 

In a- preferred embodiment of the method of the invention, a target-primer hybrid is 
attached to a solid phase support during a separating step. Such attachment may be through 
either the target nucleic acid or the primer polynucleotide. 

15 Solid phase supports for use with the invention may have a wide variety of forms, 

including microparticles, beads, membranes, slides, plates, micromachined chips, and the like. 
In addition, solid phase supports of the invention may comprise a wide variety of compositions, 
including glass, plastic, silicon, alkanethiolate-derivatized gold, GaAs, copper, germanium, 
cellulose, low cross-linked and high cross-linked polystyrene, crosslinked polyacrylamide 

20 matrices, silica gel, polyamide, membranes such as nylon, polyvinylidine difluoride (PVDF), or 
poly-tetrafluoroethylene, and the like. 

Where attachment of the target-primer hybrid is through the primer, primers may be used 
with a solid phase support on which they were synthesized, or they may be separately 
25 synthesized and attached to a solid phase support for use during or before the separation step of 
the method. 

When primers are synthesized on and used with the same solid phase support, such 
support may comprise a variety of forms and include a variety of linking moieties. Such 
30 supports may comprise microparticles or planar arrays, or matrices of regions having 
substantially uniform populations of primers. A wide variety of microparticle synthesis supports 
may be used with the invention, including microparticles made of controlled pore glass (CPG), 
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highly cross-linked polystyrene, acrylic copolymers, cellulose, nylon, dextran, latex, 
polyacrolein, and the like. Microparticle supports further include commercially available 
nucleoside-derivatized CPG and polystyrene beads (e.g. available from Applied Biosystems, 
Foster City, CA); derivatized magnetic beads; polystyrene grafted with polyethylene glycol (e.g., 
TentaGel, Rapp Polymere, Tubingen Germany); and the like. Selection of the support 
characteristics, such as material, porosity, size, shape, and the like, and the type of linking 
moiety employed depends on the conditions under which the primers are used. For example, in 
the present invention, supports and linkers that niinimize steric hindrance of the polymerase 
enzymes and that facilitate access to nucleotide substrate are preferred. Other important factors 
to be considered in selecting the most appropriate microparticle support include size uniformity, 
efficiency as a synthesis support, degree to which surface area known, and optical properties, 
e.g., clear smooth beads provide instrumentational advantages when handling large numbers of 
beads on a surface. 

As mentioned above, primers may also be synthesized on a single (or a few) solid phase 
j supports to form an array of regions uniformly coated with primers. That is, within each region 

in such an array the same primer is synthesized. Techniques for synthesizing such arrays are 
disclosed elsewhere (Pease; Southern). 

20 When primers are separately synthesized, and subsequently attached to a solid phase 

support for use, the primer may be attached to the support through a covalent linkage or a non- 
... covalent linkage. When the primer is attached to the solid support through a non-covalent 
linkage, the primer includes one member of specific binding pair, e.g., biotin, the other member 
of the pair being attached to the solid support, e.g., avidin. Several methods are available for 

25 covalently linking polynucleotides to solid supports, e.g., through reaction of a 5'-amino 
polynucleotide with an isothiocyanate-functionalized glass support (Guo). A wide range of 
exemplary linking moieties for attaching primers onto solid supports either covalently or non- 
covalently are disclosed elsewhere. (Pon; Webb; Barany;- -Damha; Beattie; Maskos and 
Southern). 
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Where attachment of the primer-template hybrid is through the template nucleic acid, and 
the template nucleic acid is a PCR amplicon, the means for covalent or non-covalent attachment 
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may be incorporated into a PCR primer used to effect the PCR. Thus, the PCR primer may 
contain a member of a specific binding pair, e.g., biotin, or a reactive moiety which can react 
with a functionalized solid support to. form a covalent linkage, e.g., a 5 '-amino group which 
reacts with a an isothiocyanate-functionalized glass support. 

As noted above, the invention also utilizes a set of tag complements which are 
complementary to corresponding tag sequences in the tagged primers. The tag complements 
are provided as an addressable array, according to the design choice of the user. By 
"addressable array" is meant that the sequence of the target binding segment of each primer is 
known or can be determined from the position of hybridization of each primer on the array. 
Preferably, the tag complements are immobilized in discrete regions on a planar surface, such 
that each discrete region contains only tag complements having a particular sequence, and such 
that the sequence of the tag complement at each different discrete region is known. 
Conveniently, the tag complements are distributed as a periodic two-dimensional array of 
discrete tag complement regions which can be indexed via X and Y coordinates, or any 
equivalent thereof. Tag complements can be attached to appropriate solid phase support 
materials following the same considerations as for attachment of primers, discussed above. 

To reduce the amounts of assay reagents used for tag detection, and to facilitate the 
sequencing of large numbers of fragment sequences, the arrays are preferably formed as 
microarrays having tag complement region densities of greater than 100 regions/cm 2 , 300 
regions/cm 2 , 10 3 regions/cm 2 , 3 x 10 3 regions/cm 2 , 10 4 regions/cm 2 , 10 5 regions/cm 2 , or 10 6 
regions/cm 2 . In addition, the number of different sequence tag complements in each array is 
preferably equal to or greater than 10, 20, 50, 100, 200, 500, 1000, 3000, 10,000, 30,000, 
100,000, or 300,000. 

D. Labeled Nucleotides 

In the methods of the present invention, one or more extendible nucleotides and/or 
nucleotide terminators include a label. The label is attached to the nucleotide in such a way that 
the label does not substantially interfere with polymerase-mediated incorporation of the labeled 
nucleotide in a primer extension reaction. Many alternative methods are available for labeling 
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nucleotides in a manner suitable for use with the present invention (Kxicka). In a preferred class 
of labeling methods, a nucleoside base of the nucleotide is modified to include a label, i.e., the 
N-6 position of a purine base or the C-5 position of a pyrimidine base. A particularly preferred 
class of labeled nucleotides are the propargylethoxyamino nucleotides (Khan). 

In one preferred embodiment of the invention, a labeled extendible nucleotide is capable 
of being rendered undetectable, e.g., upon treatment with a suitable reagent or electromagnetic 
radiation. In this embodiment, the labeled extendible nucleotide may be rendered undetectable 
by either removing the label from the nucleotide or by destroying the signal-producing 
properties of the label. 

Several methods are available for attaching a label to an extendible nucleotide such that 
the label may be easily removed. Exemplary cleavable linkers for linking a label to a nucleotide 
include but are not limited to (N-[4-(p-azidosalicylamido)butyl]-3'-[2'- 
pyridyldithiojpropionamide (APDP), (>is[2-(succinimidooxycarbonyloxyl)ethyl]siilfone 
(BSOCOES), disuccininimdyl tartarate (DST), and ethylene glycobis-[succinimidylsuccinate] 
(EGS), and the like (Pierce Catalog). 

Preferred labels whose signal producing properties may be destroyed upon treatment with 
a suitable reagent or electromagnetic radiation include fluorescent labels whose fluorescent 
properties may be destroyed by photodestruction through exposure to high intensity light or by 
chemical degradation through treatment with oxidiziiig chemicals, e.g., oxygen,, sodium 
hypochlorite, permanginate and the like. Alternative preferred classes of labels include 
enzymes, which may be rendered undetectable by reaction with an irreversable enzyme inhibitor 
or by denaturation, and chemiluminescent labels which can undergo only a single light- 
producing transformation and are thus autodestructing. 

E. First and Second Primer-Extension Reagents 

The present invention includes first and second primer extension reagents that, when used 
according to methods of the invention, enable the extension of a primer to proceed in discrete 
increments of single repeat units, i.e., the primer is extended only by one repeat unit per discrete 
primer extension reaction cycle. 



The first primer extension reagent of the invention includes a set of extendible 
nucleotides which allow a primer extension reaction to proceed only to the extent that a primer is 
extended by an amount less than a full repeat unit. Thus, depending on the particular sequence 
5 of the repeat unit, the first primer extension reagent may include a variety of possible extendible 
nucleotide combinations. For example, if the sequence of the repeat unit is AGCT, the first 
primer extension reagent could include extendible nucleotides T (complementary to A), T and C 
(complementary to A and G), or T and C and G (complementary to A and G and C). However, 
to prevent uncontrolled continuous primer extension, the first primer extension reagent may not 
10 contain all extendible nucleotides T and C and G and A. 

In certain situations, it is desirable to sub-divide the first primer extension reagent into 
separate sub-reagents such that each sub-reagent includes extendible nucleotides sufficient to 
allow extension of a primer only to the extent that a sub-portion of the repeat sequence is formed. 
15 For example, if the repeat unit is ATGCCGT, one sub reagent of the first primer extension 
reagent could include extendible nucleotides T, A, and C, while another sub-reagent of the first 
primer extension reagent could include extendible nucleotides G, and C. 

The second primer extension reagent of the invention includes a set of extendible 
20 nucleotides which allow a primer extension reaction to proceed only to the extent that the portion 
of a repeat unit not synthesized by the first primer extension reagent is synthesized. Thus, 
depending on the particular sequence of the repeat unit and the composition of the first primer 
reagent, the second primer extension reagent may include a variety of possible nucleotide 
combinations. Continuing the example discussed above, if the sequence of the repeat unit is 
25 AGCT and the first primer extension reagent includes extendible nucleotides T and C, the second 
primer extension reagent may include extendible nucleotides G and A. 

F. Primer Termination Reagent 

The present invention includes a primer termination reagent for causing the termination 
30 of primer extension such that once a primer extension product has reacted with the primer 
termination reagent, no further primer extension may be accomplished. 
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The primer termination reagent of the invention includes one or more nucleotide 
terminators, and optionally, a set of extendible nucleotides which allow a primer extension 
reaction to proceed only to the extent that a primer is extended by an amount less than a full 
repeat unit in a primer extension reaction. Thus, depending on the particular sequence of the 
5 repeat unit, the sequence of the 3' -flanking portion of the target nucleic acid, and the 
composition of the first and second primer extension reagents, the primer termination reagent 
may include a variety of possible extendible nucleotide and nucleotide terminator combinations. 
For example, if the sequence of the repeat unit is AGCT, and the first nucleotide of the 3'- 
flanking portion is G, and the first and second primer extension reagents include the extendible 
S 10 nucleotides T, C, G and A, the primer termination reagent would include only the nucleotide 
W terminator C, such nucleotide terminator being complementary to the G located in the 3'- 

m flanking portion. Alternatively, if the first and second primer extension reagents, only include 

Of the extendible nucleotides T and C, the primer termination reagent would include extendible 

s nucleotides G and A and nucleotide terminator C. 

S 15 

W In certain aspects of the present invention, the primer termination reagent includes a 

\ Li 

O labeled nucleotide terminator. The labeling of the nucleotide terminator is accomplished 

essentially as described above in Section D. 

20 III. THE METHOD 

A preferred embodiment of a first aspect of the method of the invention is schematically 
depicted in FIGS. 2A-C. In the figure, the method is applied to a target nucleic acid 5 having a . 
repeat region 20 made up of two copies of a two-base repeat having the sequence "AC" and a 3'- 
flanking portion 25 having a G nucleotide abutting the repeat region. In this preferred 
25 embodiment of the first aspect, a primer 200 is annealed to a primer-complementary portion 15 
of the target nucleic acid 5 thereby forming a target-primer hybrid. The target-primer hybrid is 
then reacted with a first primer-extension reagent including a labeled extendible nucleotide T, 
resulting in the incorporation of the labeled T nucleotide into the 3 '-end of a primer extension 
product 210. Following reaction with the first primer extension reagent, the first primer 
30 extension reagent is separated from the target-primer hybrid and the target-primer hybrid is 
reacted with a second primer-extension reagent including an extendible G nucleotide, resulting in 
the addition of the G nucleotide into a 3'-end of the primer extension product 215. Next the 
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unreacted second primer extension reagent is separated from the target-primer hybrid and a 
measurement is performed to determine the amount of labeled extendible nucleotide 
incorporated into the, primer extension product. As indicated by the histogram in the figure, at 
this point in the process, a large signal is detected, indicating the presence of the incorporated 
5 labeled T nucleotide. Finally, in order to prepare the target-primer hybrid for a subsequent 
discrete primer extension reaction cycle, the label attached to the incorporated extendible 
nucleotide is rendered undetectable. In the example the above described process steps are 
repeated two more times as shown in FIGS. 2 B and 2C. In the third cycle shown in FIG. 2C, the 
intensity of the measured signal is substantially reduced as compared to the signal intensities 
10 seen in the first two cycles because the labeled extendible nucleotide T can not be incorporated 
into the primer extension product at this point. Thus, this reduction in the measured signal seen 
in the third cycle indicates that the repeat region only contains two copies of the AC repeat unit. 

A preferred embodiment of a second aspect of the method of the invention is shown in 

15 FIGS. 3A-B. As before, the method is applied to a target nucleic acid having a repeat region 
made up of two copies of a two-base repeat having the sequence "AC" and a 3' -flanking portion 
having a G nucleotide abutting the repeat region. In this preferred embodiment of the second 
aspect, as before, a primer 200 is annealed to a primer-complementary portion 15 of a target 
nucleic acid 5 thereby forming a target-primer hybrid. The target-primer hybrid is then reacted 

20 with a first primer-extension reagent including an unlabeled extendible nucleotide T, resulting in 
the incorporation of the unlabeled T nucleotide into the 3 '-end of a primer extension product 
310. Following reaction with the first primer extension reagent, the first primer extension reagent 
is separated from the target-primer hybrid and the target-primer hybrid is reacted with a second 
primer-extension reagent, including an extendible G nucleotide, and a primer termination reagent 

25 including a labeled C nucleotide terminator, resulting in the addition of only the G extendible 
nucleotide into the 3 '-end of the primer extension product 315. Next the unreacted second primer 
extension reagent and primer termination reagent are separated from the target-primer hybrid and 
a measurement is performed to determine the amount of labeled nucleotide terminator 
incorporated into the primer extension product. As indicated in the figure, at this point in the 

30 process, no signal is detected, indicating that the labeled nucleotide terminator is not 
incorporated into the primer extension product during this cycle of the process. In FIG. 3B, the 
above described process step is repeated. In this second cycle shown in FIG. 3B, the intensity of 
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the measured signal is substantially increased as compared to the nominally zero signal intensity 
seen in the first step of the process because during the second step, the labeled nucleotide C 
terminator is incorporated into the primer extension product. 

5 The following discussion provides a more detailed description of the above-described 

method steps of the first and second aspects of the invention. 

A. Primer Annealing 

The annealing reaction is performed under conditions which are stringent enough to 
O 10 guarantee sequence specificity yet sufficiently permissive to allow formation of stable hybrids at 
•1 an acceptable rate. The temperature and length of time required for primer annealing depend 

W upon several factors including the base composition, length and concentration of the primer, and 

nj the nature of the solvent used, e.g., the concentration of cosolvents such as DMSO, fonnamide, 

or glycerol, and counter ions such as magnesium. Typically, hybridization with synthetic 
L? 15 polynucleotides is carried out at a temperature that is approximately 5 to 10°C below the melting 
ru temperature of the target-primer hybrid in the annealing solvent. Preferably, the annealing 

ru 

□ temperature is in the range of 55 to 75°C and the primer concentration is approximately 0.2 jiM. 

Under these preferred conditions, the annealing reaction will be complete in only a few seconds. 

20 B. Primer Extension Reaction 

The time required to effect a primer extension reaction depends upon the length and 
concentration of the target sequence and upon the reaction temperature. Estimates for the rate of 
nucleotide addition under typical conditions vary from 35 to 100 nucleotides per second 
depending upon the buffer, pH, salt concentration, polymerase enzyme, and the nature of the 

25 DNA template. 

In order to achieve a primer extension reaction which proceeds in discrete increments of 
single repeat units, according to the method of the invention, the primer extension reaction is 
divided into two independent steps: a first primer extension reaction and a second primer 
30 extension reaction. In the first primer extension reaction, the primer is extend by an amount less 
than the length of a single repeat unit, where control of the extent of primer extension is effected 
by the composition of a first primer extension reagent. In the second primer extension reaction, 
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the primer is extended by an amount such that, in combination with the first primer extension 
reaction, the primer is extended by an amount equal to the length of a single repeat unit. 

According to the first aspect of the method of the invention, one of the first or second 
5 primer extension reagents includes a labeled extendible nucleotide. 

Also according to the first aspect of the method of the invention, in a variant of the 
above-described two-step discrete primer extension reaction, the second primer extension 
reagent includes a primer termination reagent. Thus, if after a second primer extension reaction 

10 the primer has been extended to the end of the repeat region of the target nucleic acid, a 
nucleotide terminator will be incorporated into the primer extension product thus prohibiting any 
further extension of that primer extension product. This is advantageous because it will remove 
the possibility that any spurious primer extension beyond the repeat region of the target nucleic 
acid will take place. This embodiment of the invention is particularly preferred where two 

15 alleles of a repeat sequence are being investigated simultaneously. 

According to the second aspect of the invention, a primer termination reagent including a 
labeled nucleotide terminator is included in the second primer extension reaction. Preferably, the 
labeled nucleotide terminator is selected to be complementary to the nucleotide at the 3 '-end of 
20 the 3'-flanking portion of the target nucleic acid which abuts the repeat region of such target 
nucleic acid. 

C. Separation of Primer and Primer Extension Reagents 

Between the first and second primer extension reactions, a separation step is performed to 
25 prevent the mixing of first and second primer extension reagents and thereby prevent 
uncontrolled primer extension. The means used to effect the separation step may be any means 
capable of separating the target-primer hybrid from the first and/or second primer extension 
reagents. Exemplary separation methods include but are not limited to HPLC, electrophoresis, 
liquid-liquid extraction, solid-liquid extraction, adsorption, and the like. 

30 

In a preferred embodiment, the target-primer hybrid is attached to a solid support during 
the separation step such that the primer extension reagents may be separated from the target- 
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primer hybrid simply by washing the solid support. According to this embodiment, the primer 
may be attached to the solid support before or after performing the first primer extension 
reaction. The washing conditions are such that the target-primer hybrid is not substantially 
disrupted and nonspecific adsorption of the primer extension reagents is minimized. 

5 

D. Measuring a Signal 

Subsequent to either the first or second primer extension reaction, a detection step is 
performed wherein the amount of intact label which has been incorporated into a primer 
extension product is determined. Any detection method may be used which is suitable to the 
E 10 type of label employed. Thus, possible detection methods include radioactive detection, optical 
□ absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., 

5 - 2 

□3 fluorescence or chemiluminescence. 

: r ; 
^ ; 

0 The measuring step can take place at various points in the process depending upon the 
^15 aspect of the invention being practiced and the composition of. the first and second primer 

y extension reagents. In the first aspect, preferably the measuring step takes place after the primer 

1 u 

fU extension reagent including the labeled extendible nucleotide has been reacted with and 

i_x separated from the target-primer hybrid. In the second aspect, the measuring step takes place 

after the primer termination reagent including the labeled nucleotide terminator has been reacted 
20 with and separated from the target-primer hybrid 

If the target-primer hybrid(s) are immobilized on a solid support for analysis, extended 
primers can be detected in an addressable array by scanning all or portions of each array 
simultaneously or serially, depending on the scanning method used. For fluorescence labeling, 

25 selected regions on an array may be serially scanned one-by-one or row-by-row using a 
fluorescence microscope apparatus, such as described in Fodor (1995) and Mathies et al. 
(1992). Hybridization patterns may also be scanned using a CCD camera (e.g., Model 
TE/CCD512SF, Princeton Instruments, Trenton, NJ) with suitable optics (Ploem, 1993), such 
as described in Yershov et al. (1996), or may be imaged by TV monitoring (Khrapko, 1991). 

30 For radioactive signals (e.g., 32 P), a phosphorimager device can be used (Johnston et al., 1990; 

Drmanac et al., 1992; 1993). Other commercial suppliers of imaging instruments include 

General Scanning Inc., (Watertown, MA, www.genscan.com), Genix Technologies (Waterloo, 
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Ontario, Canada; www.confocal.com), and Applied Precision Inc. Such detection methods are 
particularly useful to achieve simultaneous scanning of multiple tag complement regions. 

5 E. Rendering A Label Undetectable 

According to the first aspect of the method of the invention, once a signal from a label is 
detected, prior to performing a subsequent cycle of discrete primer extension, the label is 
rendered undetectable. 

jr 10 In one preferred embodiment, the label is rendered undetectable by cleaving the label off 

0 of the labeled extendible nucleotide incorporated in the primer extension product. The method 

1 : I 

m used for cleaving the label from the nucleotide depends on the type of cleavable linkage used to 
jkl link the label and the nucleotide. See above. Exemplary cleavage reagents include thiol, base, 
O periodate, hydroxylamine and the like. In one preferred method, the label is attached to a base- 
ly 15 portion of a uracil nucleotide and subsequent to detection the label is cleaved off of the labeled 

2 extendible nucleotide by treatment with the enzyme uracil N-glycosylase (UNG). 

H In a second preferred embodiment, the label is rendered undetectable by destroying the 

signal-producing properties of the label itself. Depending on the type of label used, there are 

20 several methods which may be employed for destroying the signal-producing properties of the 
label. For example, if the label is a fluorescent dye, the label may be rendered undetectable by 
photobleaching the dye using an intense light source in an oxygen-rich environment. If the label 
is an enzyme, the label may be rendered undetectable by reacting the label with an irreversible 
enzyme inhibitor which renders the enzyme incapable of catalyzing a signal producing reaction. 

25 If the label is a chemiluminescent agent which can undergo only a single light-producing 
transformation, the label is autodistructing and thus does not require a separate reaction to render 
the label undetectable (Bronstein). 

F. Tag Co m plement Arrays 
30 As noted above, the invention also includes embodiments that utilize primers or sample 

polynucleotides that contain tag sequences that uniquely identify the attached primers or sample 
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polynucleotides, which can be immobilized on addressable arrays for analysis of multiple sample 
polynucleotides in parallel. 



A plurality of different-sequence primers are contacted with a polynucleotide sample O 
5 under conditions effective for the primers to anneal to primer-complementary regions in one oxo ^ 
more target polynucleotides, to form one or more target-primer hybrid(s), wherein either (jj- 
each different-sequence primer contains (i) a target binding segment and (ii) a tag segment 
having a nucleotide sequence that uniquely identifies the target binding segment, or (2) one or 
more polynucleotides in the sample are tagged polynucleotides that contain a tag segment 
□ 10 having a nucleotide sequence that uniquely identifies the attached polynucleotide. One or more 
~1 primer extension cycles are performed as described above, and the appearance or loss of signal 

5 is determined at the appropriate times to measure repeat lengths. 



Depending on the preferences of the user, the tagged primers or tagged sample 
15 polynucleotides can be contacted with an addressable array of immobilized, different-sequence 
tag complements which each contain a sequence that is complementary to one of the tag 
segments, under conditions effective to hybridize the tag segments to corresponding tag 
complements on the support. By way of illustration, embodiments using either tagged primers 
or tagged sample polynucleotides are shown in Figs. 4 and 5, respectively, after the tag 
20 segments have been hybridized to a solid phase support, and a target-primer hybrid has been 
formed to yield tertiary polynucleotide complexes. 

Fig. 4 shows a tertiary complex wherein a tag complement oligonucleotide 410 
immobilized on a solid phase support 400 is specifically hybridized to a tagged primer 420. 

25 Tagged primer 420 contains a target binding segment 422 and a tag segment 424. The vertical 
lines between the tag complement oligonucleotide 410 and tag segment 424 indicate 
complementary base-pairing. Also shown is a sample polynucleotide 440 comprising a 
sequence 442 which hybridizes to target binding segment 422, and which, in the embodiment 
shown, terminates at a nucleotide immediately adjacent to a sample repeat region 444. Sample 

30 polynucleotide 440 optionally includes flanking sequences 446 and 448 which do not hybridize 
to the tagged primer. Once polynucleotide 440 and tagged primer-420 have formed a target- 
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primer hybrid, the hybrid can be treated as discussed in preceding sections to extend the 
extendable end of the primer into the repeat region of the sample polynucleotide. Extension 
cycles are repeated until the end of the repeat region has been reached, or the desired number 
of repeats have been counted. 

5 

Fig. 5 shows an alternative tertiary complex wherein a tag complement oligonucleotide 
510 immobilized on a solid phase support 500 is specifically hybridized to a tagged sample 
polynucleotide 520. Polynucleotide 520 includes a tag segment 524 which is hybridized to 
oligonucleotide 510, and which is connected to a sample sequence 522. Sequence 522 includes 
M 10 a repeat region 528 which is flanked on either side by first and second sample segments 526 
R and 530. An extendable primer 540 is annealed to one of sample segments 526 and 530 to form 
m a target-primer hybrid such that the extendable end is adjacent to (or protrudes into) the sample 
jjl repeat region 528. The hybrid can be treated as above to extend the extendable end of the 
5 primer into the repeat region of the sample polynucleotide, until the end of the repeat region 
15 has been reached. 

l""z 
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m Tagged sample polynucleotides can be formed by any suitable method. Conveniently, 

S tagged samples can be formed by PCR amplification using primer pairs comprising first and 

second target-specific primers which flank each sample sequence of interest, wherein one of 
20 the probes contains an identifier tag segment. In one embodiment, the tagged primer is 
constructed so that the tag segment is not copied during amplification, either due to the 
presence of non-standard internucleoside linkages (e.g., PNA linkages), or due to the presence 
of a non-polynucleotide linker region which separates the tag segment from the target-binding 
segment, as can be prepared by standard synthetic methods. After the tag segment is 
25 hybridized to a tag complement on an addressable array, the non-tagged strand can be removed 
from the tagged strand, e.g., by elevated temperature and/or reducing salt concentration to 
destabilize DNA-DNA duplex structure, particularly if either the tag melts from the tag 
complement at a higher temperature than the melting temperature between the target-binding 
segment of the primer and the sample polynucleotide. Many other approaches for obtaining a 
30 selected strand from duplex nucleic acids are known in the art, and include for example, (1) 
the use of a biotinylated primer in PCR to enable capture and separation of the non-tagged 
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strand from the tagged strand under denaturing conditions, (2) the use of a PCR primer (non- 
tagged) that contains a short RNA segment which can be degraded with RNAse after PCR is 
complete, followed by destruction of the second RNAse degraded strand with an enzyme that 
has an exonuc lease activity (e.g., an appropriate DNAse), or an asymmetric PCR method 
5 which favors amplification of the tagged primer strand. 

Although Figs. 4 and 5 show particular embodiments, it will be apparent that other 
variations can be used in accordance with the methods of the invention. For example, with 
reference to Fig. 5, extendable primer 540 can be designed to bind instead to sample segment 

p 10 526, between the tag segment and the repeat region, such that the extendable end of the primer 

" is again adjacent to sample repeat region 528. 

ry The tagged primers or tagged sample polynucleotides can be hybridized to a tag 

~ complement array at any appropriate time during the primer extension process. For example, 

"!Z 15 the tagged primers of sample polynucleotides can be immobilized on the support prior to the 
hJ first contacting step in which the different-sequence primers and sample polynucleotides are 

p annealed to form target-primer hybrids. Alternatively, imm obilization can be performed after 

r the first primer extension reaction, or after the second primer extension reaction. In the latter 

case, a small aliquot of the extension reaction mixture can be removed after each extension 
20 cycle and hybridized to replicate arrays, such that each replicate array provides counting 

information after each cycle. 

IV. KITS FOR PRACTICING THE METHOD 

The present invention includes kits for carrying out the various aspects and embodiments 

25 of the methods of the invention. In a first aspect, kits of the invention include a primer, a first 
primer extension reagent, and a second primer extension reagent, wherein at least one of the first 
or second primer extension reagents includes an extendible nucleotide having a label attached 
thereto. Preferably, the label attached to the extendible nucleotide may be rendered undetectable 
following a treatment step effective to cleave the label from a primer extension product and/or 

30 destroy the signal producing properties of the label. Preferably, the primer is bound to a solid 
support, or, is capable of being bound to a solid support through a specific binding pair or 
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through a covalent linking moiety. In another preferred embodiment, this aspect of the invention 
includes a solid-phase support for attachment of a target-primer hybrid through the primer or the 
target nucleic acid. Optionally, the kits of this first aspect of the invention include a primer 
termination reagent. 

5 

In a second aspect, the kits of the invention include a primer, a first primer' extension 
reagent, a second primer extension reagent, and a primer termination reagent, wherein the 
primer termination reagent includes a nucleotide terminator having a label attached thereto. 
The second primer extension reagent and the primer termination reagent may be packaged either 
10 separately or together as a mixture. Preferably, the primer is bound to a solid support, or, is 
capable of being bound to a solid support through a specific binding pair or through a covalent 
linking moiety. In another preferred embodiment, this aspect of the invention includes a solid- 
phase support for attachment of a target-primer hybrid through the primer or the target nucleic 
acid. 

15 

In another embodiment, the kits can include (A) a plurality of different-sequence 
primers, each containing (i) a target binding segment and (ii) a tag segment having a nucleotide 
sequence that uniquely identifies the target binding segment; a first primer extension reagent; 
and a second primer extension reagent, wherein at least one of the first or second primer 
20 extension reagents includes an extendible nucleotide having a label attached thereto; and/or (B) 
an addressable array of immobilized, different tag complements, wherein each different tag 
complement contains a sequence that is complementary to one of the primer tag segments, 
under conditions effective to hybridize the tag segments to corresponding tag complements on 
the support. 

25 

The invention may be further understood in light of the following examples, which are not 
intended to limit the invention. 



Example 

30 The following study illustrates exemplary sample hybridization conditions and extension 

cycles for counting repeats in four different sample sequences using an array of target-specific 

capture oligonucleotides immobilized on a solid support. 
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A. Support . Glass microscope slides were immersed in 1 N HN0 3 for 1 to 2 hours, 
followed by rinsing with deionized water. Optionally, slides were soaked overnight in 5% 
HC1 to improve long-term stability of subsequent functionalizization. Slides were then 
5 sonicated sequentially in the following three solvents for 10 minutes each: hexane, acetone, 
and ethanol, followed by drying in air. 

A solution of 2% (v/v) aminopropyltriethoxy silane (0.8 mL in 40 mL) in 95% 
acetone:5% water was prepared in a disposable plastic Falcon tube. This amount of solution 
b 10 was usually sufficient to coat at least three slides. The solution was allowed to stand for 5 to 
H 20 minutes to hydrolyze the ethoxy groups to hydroxy 1 groups. Air-dried slides were dipped 

in the solution for 2 minutes each, and then rinsed by dipping in three or more successive 
acetone baths. 
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h 15 The slides were cured in 100°C oven for 45 minutes. Cured slides were treated for 2 

hours with a 0.2% solution of 1 ,4-phenylene diisothiocyanate (PDITC) in 10% 
pyridine: dimethyl formamide, followed by washes in baths of methanol and acetone, and air- 
drying. The slides may be stored under vacuum with a dessicant at 4°C. Stacking the slides 
also helps preserve the functionalization and keeps the functinalized surfaces free from 
20 particulates. (In an alternative embodiment, instead of 1 ,4-phenylene diisothiocyanate, the 
cured slides can be treated with a 1 mM solution of EMCS (N-(e-maleimido caproxyl) 
succinimide in methanol rdimethylsulfoxide (80:20) for 2 hours.) 

B. Immobilization of Capture Oligos . Synthetic capture oligonucleotides were 
25 prepared by standard phosphoramidite synthetic methods. The capture oligonucleotides had the 
following general structure: 5'-amino-[(PEO)J-[capture oligo (24 nt)], where PEO = -O- 
(CH 2 CH 2 0) 6 added via DMT-0-(CH 2 CH 2 0) 6 -phosphoramidite. The capture oligonucleotides 
had polynucleotide sequences complementary to the double-underlined sequences in the four 
sample sequences shown in section C below. 

30 
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The capture oligos were spotted in a rectangular array pattern onto slides prepared as 
above using an ASYMTEC robotic loader (Asymtec, Carlsbad, CA, Model No. C-708), with 
oligonucleotide concentrations of about 50 uM (in 100 mM Na,CG 3 , pH 9.0) and spotting 
volumes of about 0.2 to 1 nL. The pattern included a separate set of 8 spots (2x4 rows) for 
5 each different capture oligo to provide redundancy. The spots had diameters of about 200 
micrometers and were spaced apart by 320 micrometers center-to-center. The robotic loader 
included a slide holder to hold the slides during oligonucleotide deposition, and a blotting area 
for cleaning dispenser tips. 

5 10 After spotting, the capture oligos were fixed onto the slides by incubating the slides in a 

H humidity chamber at room temperature for 60 minutes, followed by soaking in 1% NH 4 OH 

5 (aq) for 20 minutes and in 0.2% casein in TBS (50 mM Tris, 150 mM NaCl, pH 8) for 20 

nJ minutes. The slides were washed in deionized water and stored at 4°C. 

O 

[Z 15 c. Sample Hybridization and Analysis . Four different sample oligos were prepared to 

5 represent four different microsatellite alleles. In brief, the sample polynucleotides included the 

□ following sequences, where smgle-underlining indicates repeat regions, and double underlining 

indicates segments for hybridizing to complementary capture oligos on the support: 

20 Sample Oligo 2260-2G (SEP ID NO: 1) 

5 ' -GTC A GG AC AC AAAGTG ATTTG ATGT AG ATTTTG A- 3 ' 

Sam ple Oligo 1411-1G (SEP ID NO:2) 

5 ' -GTCAGGACT GATAAAGTGTAAAAGTGTATGAT -3 ' 

25 

Sample Gligo 3149-2T (SEP ID NP:3) 

S'-GTGATT ACACTGTATGATAAAGGATTTTGATTGA -3' 

Sample Pligo 4223-4T (SEP ID NP:4 
30 S'■GTCATT ACACACACGTATTGATTTGATTGATTGAGATT -3 , 
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A mixture of the four sample oligos (0.33 uM each in IX PCR buffer containing 1.5 
mM MgCl 2 , 50 mM KC1, 1.5 mM MgCl 2 , 10 mM Tris-HCl, pH 8.3, and 0.001% (w/v) 
gelatin) was placed over the capture oligos on the array and allowed to incubate for 1 hour at 
room temperature. After the incubation, unbound oligos were removed with IX TE (10 mM 
Tris, pH 8, 1 mM EDTA) containing 50 mM NaCl. Replicate slides were subjected to 
different numbers of extension cycles (one, two, three or four cycles). Each extension cycle 
(also called a "reagent cycle") involved the following steps: 

1) Incubate slide for 4 minutes at 37°C in an extension mixture containing: 

500 uM dGTP 

20 uM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax 
20 vM Big Dye dRox ddCTP (Perkin-Elmer, Foster City, CA, Emax 
1 U/^L AMPLITAQ FS (plus 0.125 U/^L pyrophosphatase) 
IX buffer (80 mM Tris, pH 9:0, 2.5 mM MgCU) 

2) Rinse slide with IX TE containing 50 mM NaCl, then wash 3 times by immersion in same 
solution. 

3) Incubate slide for 4 minutes at 37° C in an extension mixture conta i ni n g: 

500 vM dATP 

20 uM Big Dye R6G ddATP (Perkin-Elmer, Foster City, CA, Emax 
20 pM Big Dye dRox ddCTP (Perkin-Elmer, Foster City, CA, Emax 
1 U/^L AMPLITAQ FS (plus 0.125 U/^L pyrophosphatase) 
IX buffer (80 mM Tris, pH 9.0, 2.5 mM MgCl^ 

4) Rinse slide with IX TE containing 50 mM NaCl, then wash 3 times by immersion in same 
solution. 

After 4 extension cycles had been completed (each cycle included steps 1 through 4), each 
slide was overlayed with a viewing buffer containing 50% (w:v) urea and IX TBE (0.09 M 



= 560 nm) 
= 615 nm) 



= 560 nm)- 
= 615 nm) 
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Tris-borate, 2 mM EDTA, pH - 8.3) to provide an alkaline pH environment to enhance the 
fluorescence emissions of the dye labels. 

The slides were each viewed using an imaging device set for fluorescence emission 
5 detection at 560 nm and 610 nm. The device was an imaging fluorimeter that produces a two- 
dimensional image array of emission intensities (electronic image). Each point in the image 
array corresponds to a physical location on the sample slide. The excitation source is a 40 mW 
argon ion laser. The excitation wavelength can be selected from either of two natural laser 
lines (488 nm and/or 514 nm). The selection of laser wavelengths is accomplished by passing 

10 the beam which contains light of both wavelengths through one of three filters. The selected 
filter will pass either 488 nm only, 514 nm only, or both 488 and 514 nm. The filters are 
mounted on a moter driven platform so that the selection can be performed under computer 
control. The beam containing either or both wavelengths passes through a classical beam 
expander. The collimated expanded beam is then directed into a commercially available 

15 assembly (Scanlab, Puchhein, Germany, Model SK1020) containing two orthogonal "galvo 
mirrors n . This is a mechanical device designed to rotate each of the two mirrors quickly and 
precisely over a small angle. The axes of rotation are orthoganal and independent so that the 
beam can be rastered over a rectangular pattern, also under computer control. 

20 The scanned beam is focused by an f-theta lens which forms a scanned laser spot at one 

focal length. The spot size is 20 jam diameter. Typically, it is stepped over an area of 20 mm 
x 20 mm in increments of 20 jam, thereby forming a 1000 x 1000 array of pixels. The focused 
laser spot is formed after passing through a dichroic beam splitter which is designed to 
reflected the laser excitation wavelengths but transmit the fluorophore emission. 

25 

As the laser beam steps across a fluorescent region, the fraction of fluorescence 
radiation within the solid angle of collection of the emission optics is transmitted by the 
dichroic mirror and is collimated by a lens having a 150 mm focal length. The collimated 
beam is then filtered by a long pass interference filter which further rejects laser light. A 
30 second lens of 75 mm focal length focuses the beam onto the photocathode of a red sensitive 
photomultiplier tube (PMT). The PMT photocathode does not need to be carefully adjusted 
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since the focus is not critical. A filter wheel in front of the PMT allows only a small 
wavelength band (10 nm) to reach the detector. The filter wheel can be adjusted to one of six 
positions to allow for emission of multiple fluorophores to be discriminated. 

The electrical - output of the PMT is proportional to the intensity of the light reaching 
the photocathode. The computer system is capable of storing the signals at each of an array of 
mirror positions, to form the electronic image. 

The results were as expected. Specifically, for oligo 1 (SEQ ID NO:l), no signal was 
observed until completion of the second cycle, after which two GT dimers and a fluorescence- 
labeled ddC terminator had been appended to the immobilized capture oligo, which also served 
as the extendable primer. Similarly, for oligos 2 through 4 (SEQ ID NO:2 through 4), the 
appropriate fluorescent signal was observed after 1, 2 and 4 extension cycles, respectively. 

All publications and patent applications refered to in this disclosure are herein 
incorporated by reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by reference. 

Although only a few embodiments have been described in detail above, those having 
ordinary skill in the molecular biology art will clearly understand that many modifications are 
possible in the preferred embodiments without departing from the teachings thereof. 
Accordingly, all such modifications are intended to be encompassed within the scope of the 
following claims. 
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